9.2 Motivation¶

Three advantage of convolution * Sparse interpretation * Parameter Sharing * Equivariant Representation

Sparse Interpretation¶

Traditional Matrix Multiplication: Every output unit interact with every inout unit.
Convolution: Sparse connectivity

We need to store fewer parameters, which improves

Memory requirement
statistical efficiency

This allows the network to efficiently describe complicated interaction between many variables by construction such interaction from simple building blocks that each describe only sparse interaction.

Equivariance to Translation¶

If the input changes, the output changes in the same way. Specifically, a function f(x) is equivariant to function g if \(f(g(x)) = g(f(x))\). Eg:

g: shift image
f: convolution

When processing time-serise data: convolution produces a sort of timeline that shows when different features appear in the input. If we move an event later in time in the input, the exact same representation of it will appear in the outout

Convolution is not natually quivariant to some other transformation such as changes in scale or rotation of an image.

Enables processing flexible shaped data¶

Discussed in 9.7

Resource¶

cs231n Assignment 2 Conv Net

9.2 Motivation¶

Sparse Interpretation¶

Parameter sharing¶

Equivariance to Translation¶

Enables processing flexible shaped data¶

Resource¶